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ABSTRACT 

Context. Identifying frequencies with low signal-to-noise ratios in time series of stellar photometry and spectroscopy, and measuring 
their amplitude ratios and peak widths accurately, are critical goals for asteroseismology. These are also challenges for time series 
with gaps or whose data are not sampled at a constant rate, even with modern Discrete Fourier Transform (DFT) software. Also the 
False-Alarm Probability introduced by Lomb and Scargle is an approximation which becomes less reliable in time series with longer 
data gaps. 

Aims. A rigorous statistical treatment of how to determine the significance of a peak in a DFT, called SigSpec, is presented here. 
SigSpec is based on an analytical solution of the probability that a DFT peak of a given amplitude does not arise from white noise in 
a non-equally spaced data set. 

Methods. The underlying Probability Density Function (PDF) of the amplitude spectrum generated by white noise can be derived 
explicitly if both frequency and phase are incorporated into the solution. In this paper, I define and evaluate an unbiased statistical 
estimator, the "spectral significance", which depends on frequency, amplitude, and phase in the DFT, and which takes into account 
the time-domain sampling. 

Results. I also compare this estimator to results from other well established techniques and assess the advantages of SigSpec, through 
comparison of its analytical solutions to the results of extensive numerical calculations. According to those tests, SigSpec obtains 
as accurate frequency values as a least-squares fit of sinusoids to data, and is less susceptible to aliasing than the Lomb-Scargle 
Periodogram, other DFTs, and Phase Dispersion Minimisation (PDM). I demonstrate the effectiveness of SigSpec with a few examples 
of ground- and space-based photometric data, illustratring how SigSpec deals with the effects of noise and time-domain sampling in 
determining significant frequencies. 

Key words, methods: data analysis - methods: statistical 



1. Overview 



In this paper I provide a brief introduction to Fourier methods 
in astronomical time series analysis (Section|2|, outline existing 
statistical approaches (Section[3]), and address the major weak- 
nesses in available techniques (Section|4|. 

While Sections|2] to [4] contain previously available (text- 
book) information, I introduce a new and unbiased reliability 
criterion (spectral significance) based on theoretical statistics in 
Section|5] This section also addresses the correspondence be- 
tween the spectral significance and other reliability estimators. 
Section|6]is devoted to the comparison of the analytically de- 
duced spectral significance to the results of numerical simula- 
tions. Finally, the results of comparative tests of spectral signif- 
icance computation vs. other period detection methods are pre- 
sented. 

An example for the practical application of the new method 
vs. a widely used standard procedure is provided in Section|7] It 
may be useful for the non-mathematically oriented reader who 
is mainly interested in how the technique performs in 'real life' . 

Further topics (Section[8j are the application of statistical 
weights, the fact that a time series consists of individual subsets, 
and potential problems with colored vs. white noise. 



2. Introduction 

In Fourier Analysis a continuous function of time over a finite 
time interval is expanded into a series of sine waves. These 
waves represent a superposition of an oscillation at a fundamen- 
tal frequency and a discrete, generally infinite set of overtones. 
The fundamental frequency is determined by the reciprocal time 
interval width. All other frequencies correspond to integer mul- 
tiples of this fundamental. The knowledge of the amplitudes and 
phase angles for all frequency components permits one to en- 
tirely recover the given function in the time domain. 

Practical applications (such as astronomical observations) 
generally deal with discrete sets of measurements (time series) 
rather than continuous functions of time and, on the other hand, 
consider the Fourier Spectrum as a continuous function of fre- 
quency rather than a discrete dataset. This leads to the Discrete 
Fourier Transform (DFT). It allows one to determine the domi- 
nant frequencies of the observed physical process with a higher 
frequency resolution than is possible with Fourier Analysis. 

Motivated by the desire to understand physical oscillations, 
the scientist is interested in a couple of eigenfrequencies and the 
exact determination of related amplitudes and phases rather than 
the complete signal recovery. In practical applications, these are 
considered to correspond to local maxima (peaks) in the ampli- 
tude spectrum. A widely held strategy is to 

1 . compute an amplitude spectrum for the given dataset, 
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2. identify the maximum amplitude within the frequency range 
of interest, 

3. decide whether this amplitude is 'significant', 

4. subtract the corresponding sinusoidal signal from the time 
series, and 

5. use the residuals after subtracting the fit from the time series 
for the next iteration. 

This procedure is to be understood as a loop, terminated if the 
maximum amplitude is not considered significant any more. The 
result of these consecutively performed prewhitenings is a list of 
frequencies, amplitudes, and phase angles, plus a residual time 
series (hopefully) representing the pure observational noise. In 
fact, part of this noise is due to measurement errors, but fre- 
quently merged with signal components the amplitudes of which 
are too weak to be detected. As an example, the number of 
photometrically resolved frequencies in the 6 Set star 4 C Vn in- 
creased from 5 to 34 between 1990 and 1999 (Breger et al. 1990, 
1999). 

In many cases, the results of the prewhitening are subject 
to a multiperiodic least-squares fit (e.g., Sperl 1998; Lenz & 
Breger 2005). This represents a fine tuning to adjust frequencies, 
amplitudes, and phases to a minimum rms residual, but may lead 
to the exclusion of some terms, or inclusion of new terms. 

The eigenfrequencies, amplitudes, and phases of stellar os- 
cillations provide fundamental information on the distribution of 
mass and temperature, the radiative and convective energy trans- 
port, or abundances of elements in the stellar atmosphere and 
help to determine fundamental parameters such as mass, radius, 
effective temperature, rotational velocity, and age of a star. 

3. Statistical aspects 

In practical applications, a signal is not only of stellar origin but 
a superposition of the information received from the star, instru- 
mental (pseudo-)periodicities (e.g., invoked by thermal effects), 
and transparency variations in the Earth's atmosphere. To elimi- 
nate the third component, observations with instruments aboard 
of spacecraft have been used increasingly by the astronomical 
community during the past two decades. Unfortunately, stray 
light scattered from the illuminated surface of the Earth intro- 
duces additional quasi-periodic artifacts that are not easy to han- 
dle and hence represent the major constraint to the accuracy of 
space-based data acquisition (Reegen et al. 2006). 

An unbiased criterion to decide, whether a peak amplitude is 
generated by noise or represents an intrinsic variation, is impor- 
tant, because the choice of the most significant peak determines 
all further iterations in the prewhitening sequence. A falsely 
identified signal component usually perturbs all results obtained 
subsequently. 

In addition, Scargle (1982) points out that Gaussian noise 
in the time domain may produce a peak of arbitrary amplitude 
in the DFT spectrum. Since there is no natural upper limit to 
amplitudes produced by noise, the danger of misinterpretation is 
imminent, and the 'significance' of an examined peak needs to 
be described by a probability distribution. 

Of course, the probability that white noise produces a low- 
amplitude peak is higher than that for a high-amplitude peak. In 
other words, the False-Alarm Probability that the highest peak 
in an amplitude spectrum is an artifact due to noise is lower for 
higher amplitudes. This definition is reasonable because it relies 
on the highest available peak, since one cannot trust any peak, 
if even the one with the highest amplitude is unreliable. In this 
perspective, the False-Alarm Probability appears to be a good 



criterion of whether to believe in the presence of a signal in a 
given dataset. 

The statistical description of the False-Alarm Probability re- 
lies on a Probability Density Function (PDF), which is the con- 
tinuous version of a histogram, as the bins of which become in- 
finitesimally small. Thus the False-Alarm Probability is an in- 
tegral over the PDF. In many statistical applications, probability 
distributions may easily be described by the PDF, but an analytic 
expression of the integral does not exist, as e. g. for the Gaussian 
distribution. Hitherto many statistical problems are solved in 
terms of PDFs rather than of the corresponding cumulative quan- 
tities. 

The PDF of the amplitude spectrum generated by pure 
Gaussian noise at equidistant sampling was deduced by Schuster 
(1898; also Scargle 1982). For non-equidistant sampling, the 
Lomb-Scargle Periodogram (Lomb 1976) is claimed to provide a 
better statistical behavior (Scargle 1982). Koen (1990) examined 
the application of the Fisher test (1929) to the Lomb-Scargle 
Periodogram to estimate peak significance, and Schwarzenberg- 
Czerny (1996) combined Lomb's solution with the analysis of 
variance method (Schwarzenberg-Czerny 1989) to obtain a pow- 
erful period search technique for non-equidistant data. 

A widespread, but purely empirical approach is the consid- 
eration of a peak in the amplitude spectrum to be 'real', if its 
amplitude signal-to-noise ratio exceeds a given limit. Breger et 
al. (1993) suggest a value of 4. Numerical simulations for 19 300 
synthetic time series by Kuschnig et al. (1997) return an associ- 
ated False-Alarm Probability of 10~ 3 for 1 000 data points. 

An alternative way of determining periods was intro- 
duced by Lafler & Kinman (1965) and statistically examined 
by Stellingwerf (1978). The Phase Dispersion Minimization 
method is based on the assumption that the correct period would 
produce a phase diagram with the lowest intrinsic scatter. This 
algorithm is a powerful tool especially for non-sinusoidal peri- 
odicities, e. g., if the stellar surface structure is observed pho- 
tometrically as the star rotates. In this situation, the advantage 
compared to Fourier techniques is the simultaneous treatment of 
all overtones that determine the shape of the signal. 

In the case of multiperiodic variability, PDM loses accuracy, 
since the phase diagram for one frequency is contaminated by 
all others, unless they are integer multiples of this frequency. 

4. Problems 

A closer examination of the fundamental properties of the DFT 
leads to the following issues: 

1. The peak frequency in the amplitude spectrum of a single 
sinusoidal signal is not recovered correctly (Kovacs 1980). 
This is a side effect of the restriction to a finite time inter- 
val rather than a property of non-equidistant sampling. Also 
the Lomb-Scargle Periodogram suffers from systematic peak 
frequency displacements (see |6.21 >. 

2. The 'entire' spectrum is defined by the Nyquist (Critical) 
Frequency as the upper limit for equidistantly sampled data 
(Whittaker 1915; Nyquist 1928; Kotelnikov 1933; Shannon 
1949; Press et al. 1992). In the case of non-equidistant sam- 
pling, there is no unique definition of such a limit. Home & 
Baliunas (1986) suggest the average sampling interval width 
for the calculation of the Nyquist Frequency; Scargle (1982) 
and Press et al. (1992) promote the minimum sampling in- 
terval; a variant discussed by Eyer & Bartholdi (1999) is to 
pad the entire dataset with zeros to achieve equidistant, but 
much denser sampling in the time domain and to take the 
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resulting sampling interval as a Nyquist Frequency estima- 
tor. The most promising method was introduced by Sperl 
(1998): each sampling interval of the non-equidistant se- 
ries is considered responsible for its own, individual Nyquist 
Frequency. In a subsequent step, a histogram of all individ- 
ual Nyquist Frequencies helps to decide where to define a 
healthy upper frequency limit. 
3. Periodic gaps in the time-domain sampling introduce peri- 
odicities in the Fourier Spectrum. In this case, a single si- 
nusoidal signal is represented by a peak that is accompanied 
by several aliases. In order to overcome the systematic ef- 
fects of non-equidistant sampling in the frequency domain, 
Ferraz-Mello (1981) examined the possibility of incorporat- 
ing the sampling properties into the DFT amplitude spectrum 
by introducing appropriate statistical weights. His study led 
to the design of harmonic filters, which may help to improve 
the selection of peaks. A different approach is the consider- 
ation of the entire 'comb' of aliases instead of the peak with 
the highest amplitude only (Roberts, Lehar & Dreher 1987; 
Foster 1995). 

The goals of the present paper are 

- to deduce the depedence of the Probability Density Function 
of non-equidistantly sampled Gaussian noise on frequency 
and phase (Section[5]l, 

- to introduce spectral significance as a measure of False- 
Alarm Probability ( 15. 5t , 

- to compare the theoretical solution with the results of numer- 
ical simulations (16. U . and 

- to quantitatively oppose the accuracy of the peak frequencies 
using spectral significance (16.21 16.31 l vs. the 'simple' DFT 
amplitude, the Lomb-Scargle Periodogram, PDM, and the 
DFT amplitude plus improvement of resulting peak frequen- 
cies by least-squares fitting. 

A frequent problem in the context of astronomical observa- 
tions is that all measurements do not necessarily have the same 
variance. This may be due to different instrumentation for dif- 
ferent subsets of the time series or changing environmental con- 
ditions, such as thermal noise or transparency variations in the 
Earth's atmosphere. Statistical weights are introduced into the 
spectral significance in order to take appropriate account of the 
variable quality of measurements within a single dataset ( 18. U . 

5. Amplitude PDF for non-equidistant sampling 

This section presents the theoretical evaluation of the frequency- 
domain PDF for a non-equidistantly sampled time series repre- 
senting Gaussian noise. 

A very important fact for the statistical analysis is, that in 
most practical applications, the mean value of the observable is 
shifted to a fixed value (frequently zero) before the DFT is eval- 
uated. The statistical description of frequency-domain noise has 
to take this fact into account. 

In statistical terms, the Fourier Coefficients are regarded as 
weighted sums of random variables. Such sums tend towards a 
Gaussian distribution as datasets become large enough. Since the 
Fourier Coefficients for each frequency represent the Cartesian 
components of the two-dimensional Fourier Space, the result- 
ing Gaussian distribution will be two-dimensional (bivariate) as 
well. Furthermore, as the Fourier Coefficients are functions of 
frequency, the considered Fourier-Space probability distribution 
will depend on frequency, correspondingly. 
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Fig. 1. Schematic illustration of a time series with periodic 
gaps representing Gaussian noise with a standard deviation 
equal 1 (top) to visualize the dependency of the corresponding 
frequency-domain noise on frequency and phase, as well as on 
the characteristics of the time-domain sampling. In the two ex- 
amples referred to in the lower panels, a test signal is displayed 
(dashed lines) for which the DFT shall be evaluated. If the fre- 
quency and phase are combined such that the data points con- 
sistently align around the times when the trigonometric func- 
tion attains a small value, only a small fraction of the time- 
domain noise will be transformed into the frequency domain 
(mid). For a combination of frequency and phase grouping the 
data points around the maxima and minima of the test signal, 
the time-domain noise will produce a higher frequency-domain 
noise, correspondingly (bottom). The first of these two cases pro- 
duces a narrower probability distribution in Fourier Space, and 
consequently, a signal with the same amplitude will be consid- 
ered more reliable in the first case than in the second. 



For a time series with gap^] (Fig.Q~|l, the relevant influence of 
the time-domain sampling on the frequency-domain probability 
distribution is determined by the phase coverage of the measure- 
ments: a combination of frequency and phase for which the inter- 
vals containing data are consistently associated to angles where 
the corresponding trigonometric function ('test signal') attains 
low numerical values will result in a lower noise level than a 
combination allocating the data close to angles where the max- 
ima/minima of the test signal are located. The first case will yield 
a narrow probability distribution, the second case a broad one. 
Consequently, a signal with the same amplitude has be consid- 
ered more significant in the first case. These strong phase depen- 
dencies are mitigated for frequencies providing a better phase 
coverage. 

Consequently, the amplitude distribution in Fourier Space 
will have to be a function of both frequency and phase, which 



1 The time base represents 10 nights of ground-based photometry of 
the star IC 4996 #89 (Zwintz et al. 2004; Zwintz & Weiss 2006). See 
|5.6| for further detail. 
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is achieved by expressing the bivariate Gaussian PDF in polar 
rather than Cartesian coordinates. 

The probability of a peak generated by noise to reach a given 
amplitude level may be evaluated through integration of the 
PDF over amplitude, which leads to the Cumulative Distribution 
Function ( CDF) and False-Alarm Probability, based on which a 
more informative quantity, the spectral significance, is defined. 



5.1. Zero-mean correction 

In astronomical applications, magnitudes are usually averaged 
to a pre-defined constant (zero or non-zero, as obtained by some 
theoretical concept or calibration) before the amplitude spectrum 
is evaluated. The following considerations apply to observables 
adjusted to zero mean. If a non-zero constant is chosen instead, 
the DFT will change, but the False-Alarm Probabilities will re- 
main the same. In this case, one prefers to evaluate a DFT spec- 
trum for the time series as it is but use a zero-mean corrected ver- 
sion of the dataset for the computation of spectral significances. 

Considering a time series Xk := x(fk) to be generated by a 
Gaussian random process with expected value and population 
variance (x 2 ^, the time-domain PDF is 



<P(xk) :- 



_±_ 

2( t 2 ) 



(1) 



Given a random process that produces an infinite population of 
Gaussian random variables, the mean of a finite sample of ran- 
dom variables Xk is free to scatter around the population mean. 
Gauss's law of error propagation returns a variance of the sample 
mean which is proportional to the inverse number of data points 
in the sample, 



Coefficients according tcQ 

1 K ~ l 1 
azmH := — 2j x k cos o>t k - ^ x k ^ cos cot, . 



A'-l 



k=0 
K-l 



bzM (w) 



k=0 



k=0 
K-l 



/=() 
K-l 



Xk sin utk 



K 2 



sin <±>t{ . 



(3) 



(4) 



/.=o 



Due to the linearity of the Fourier Transform, the subtraction of 
a constant in the time domain refers to a subtraction of a spectral 
window in the frequency domairQ. 
Rearrangement of indices yields 



i C=J 

«zm(w) = — 2^ 

k=0 

^zm(w) = — ^ 
k=0 



Xk 



Xk 



COS OJtk 



sin ojtk 



1 K ~ l 

--Y 

K-l 

--Y 

K 4-! 



K-l \ 
COS Utl 

1=0 
A'-l 

sin coti 

1=0 



(5) 



(6) 



Given pure Gaussian noise in the time domain with a popula- 
tion variance (* 2 ), eqs.[5] IS] allows one to consider both Fourier 
Coefficients as linear combinations of Gaussian variables with 
expected values and variances 



(< 



(x 2 \ K - { 



(^)(-) = yz 



sin utk 



1 K ~ X 



K-l \ 
COS Ut\ 

1=0 
A'-l 

sin utl 

1=0 



(7) 



(8) 



Thanks to the Central Limit Theorem (de Moivre 1718; 
Stuart & Ord 1994, pp. 31 Of), the consideration of these coef- 
ficients as Gaussian variables holds to a sufficient degree, even 
if the time-domain noise is not Gaussian, because even short 
datasets in astronomical applications are long enough compared 
to the fast convergence of the PDF towards the Gaussian distri- 
bution with an increasing number of random variables. 



(2) 



If the finite sample is artificially adjusted to zero average, the 
sample mean value is not allowed to scatter any more. Since the 
standard error of an individual data point implicitly contains the 
standard error of the mean, too, zero-mean correction will dis- 
tort the PDF of the random variable. The only exception with 
an invariant PDF is the Fourier Analysis, i. e., equidistant time- 
domain sampling and a set of discrete frequencies. 

An alternative (and promising) method in this context is the 
Floating-Mean Periodogram (Cumming, Marcy & Butler 1999), 
which is based on a least-squares fit of a sinusoid plus a constant 
to the time series, the latter retaining the free scatter of the sam- 
ple mean. 



5.2. Distribution of Fourier Coefficients 

Incorporating the effect of zero-mean correction into the sta- 
tistical examination, the zero-mean corrected magnitude values 
x k — j Jjk=o x k have to be used for the calculation of Fourier 



5.3. Frequency- and phase-dependent PDF 

Since the DFT produces a two-dimensional vector (a,b), the 
probability distribution in the frequency domain will also be 
two-dimensional, so-called bivariate. 

The combined probability density of two independent 
Gaussian variables a, f3 with corresponding variances (o" 2 ), (j3 2 ^j 
is given by a bivariate Gaussian PDF, 



1 



(9) 



2nyl{a 2 ){p 2 ) 
if the covariance (a/3) vanishes. 

2 Some applications prefer a different normalization of eqs.[5][6] E. g., 
in publications dealing with the theoretical aspects of functional anal- 
ysis, both Fourier Coefficients and the inverse transform from the fre- 
quency into the time domain are frequently normalized by K~i instead 
of A' -1 . Also in the field of communications engineering, different nor- 
malizations are used. In fact, one is free to distribute the normalization 
factors among these relations arbitrarily, as long as the product of both 
factors is K~ [ . 

3 Since the Fourier Analysis of equidistant time series is restricted 
to discrete frequencies associated to orthogonal DFTs, nothing will 
change for non-zero frequencies in this special case. 
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The fact that the Fourier Coefficients a^M, bzM of pure 
noise are two linear combinations of the same random vector 
in Fourier Space diminishes the degrees of freedom by 1 . Hence 
they may be considered independent to a sufficient degree, if the 
sample size is large enough. Consequently, if (azm^zm) = 0, 
eq.|9] describes the bivariate distribution of Fourier Coefficients 
related to noise satisfactorily. According to Appendix|A] rotating 
the Fourier Space coordinates by an angle 9q given by 



tan 20o (co) = 

K Zf~o sin 2^ 



K 2f=o cos 2tot k - (2X0 cos ^k) + (Zto sin cot k ) 



(10) 



transforms the Fourier Coefficients ozm, bzM into coefficients a, 
ft with zero covariance, as desired. 

The DFT of a measured time series Xk contains only an am- 
plitude A but also a phase angle 



>(*>) 



Xk sin (±>tk 



2f = o x k COS OJtk 



(11) 



This additional information may be taken into account by evalu- 
ating the conditional probability density of amplitude for a con- 
stant phase angle 9, pre-defined by the DFT of the time series 
under consideration at the frequency co. 

The transformation of eq.|9]from Cartesian into polar coor- 
dinates (A, 9) is performed via dad/3 - AdAd9 (Appendix IB.U 
with 



a = —cos (9- 6» ) , 
ft = ^-sin(0-#o) , 



(12) 
(13) 



where the fact, that the coordinate system is rotated by a con- 
stant angle 9q, does not contribute to the Jacobian of the trans- 
formation. The division by 2 is introduced by collecting the con- 
tributions of both A (co) and A (-co) - which are equal for real 
observables - to the total amplitude. The transformed amplitude 
PDF becomes 



>(A,9\ co) = 



2 {e-» ) , s in 2 (g-8 ) I 



2ttVVW> 



(« 2 > V) 



(14) 



Of course, <p does not only depend on amplitude A and phase 
9, but also on ao, fto, and 9q, which are determined by the time- 
domain sampling and are functions of frequency co. The bar sym- 
bol in <p (A, 9 1 cu) is introduced to formally separate random vari- 
ables to those considered constant. 

Changing the normalization condition from 



JjTj, dA d9</>(A,0\to) = 1 into j K dA<f>(A\ co,9) = 1 yields 



<P(A\co,9) = ^e" 
with 



R : = 



(a 2 ) (ft 2 ) 



\ (ft 2 ) cos 2 (9 - flb) + (a 2 ) sin 2 (9 - 9 ) 



(15) 



(16) 



The difference between eq.[14]and eq.[15]is that eq.[T4lis a bivari- 
ate PDF of amplitude and phase, whereas in eq.[l5]only the am- 
plitude A is considered as a random variable, and 9 is constant. 
This relation returns the probability density of amplitude for a 



fixed frequency and a fixed phase in Fourier Space. Accordingly, 
the PDF is normalized by the condition that its integral over the 
entire amplitude range (from to 00) has to be 1 . Furthermore, 
amplitudes being defined > introduce a factor 2 into the argu- 
ment of the exponential function. 

Eq. [16] presents R(9) as an ellipse in polar coordinates. The 
semi-major and semi-minor axes are ~<J(a 2 ) (eq. IA.3b and ^(ft 2 ) 



(eq. IA.4l i. respectively, and the orientation is determined by 9q. 
This ellipse will be called the rms error ellipse. Its orientation 
and dimensions depend on frequency. 

Eq.[10]has got a set of solutions for 6q assigned to orthogonal 
directions: if 6>o is a solution, then the complete set of solutions 
is #0 + f Vz € Z- Whether ^j(a 2 ) returns the semi-major axis 

and y/(J3 2 ) the semi-minor axis of the rms error ellipse, or vice 
versa, depends on the choice of This paper consistently uses 
solutions of #0 that assign ao to semi-minor axes, which yields 
the maximum spectral significance for all phase angles under 
consideration. 

As shown later (see 15.61 ), the introduction of the normalized 
semi-major and semi-minor axes, 



a (co,9 ):= J2K 



(a 2 ) 



A'-l 



A jF2 I K Yj CO&1 ^ wtk ~ 0o - ) ~ Yj COS ^ ~ 00 ^ 

\ I k=0 [l=0 



K-\ 



(17) 



ft (a),eo) := MK 



-2 itf^sin 2 (cot k -9 )- 

k=Q 



K-\ 



2^ sin (cot, - 9 ) 

1=0 



(18) 



respectively, provides the separation of sampling-dependent 
quantities from quantities that only depend on the DFT ampli- 
tude. In this context, the term 'normalized' means that for ar- 
guments cotk - &o uniformly distributed on [0, 2n], the expected 
values of both ao and fto are 1 . 

Given the orientation of axes, 9q, and the normalized axes, 
ao,fto, of the ellipse, the standard deviation for an arbitrary phase 
9 in Fourier Space is the radius of an ellipse in polar coordinates 
(R, 9) according to 



R 



2K ft 1 cos 2 (9 - 9 ) + a 2 sin 2 (9 - 9 ) 



(19) 



The three parameters 9q, ao, fto describe the ellipticity of the 
two-dimensional PDF for Gaussian noise in Fourier Space and 
thus represent the cornerstones of spectral significance evalu- 
ation. All subsequent relations will be given in terms of these 
three quantities. 

5.4. False-Alarm Probability 

The Cumulative Distribution Function ( CDF) is obtained by in- 
tegrating the PDF (eq.fTBIl according to 



3> (A I to, 6) = [ dA'<f> (A' I co, 9) 
Jo 

which yields 

<D (A I to, 0) = 1 - e~i& . 



(20) 



(21) 
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Thus the probability for an amplitude to exceed a given limit A 
is given by 



<&fa (A | o), ff) = e 



(22) 



which is the False-Alarm Probability of an amplitude level at 
phase 9 (and frequency a>, since 9, 9o, ao, and /3q are frequency- 
dependent quantities). 

5.5. Spectral significance 

The frequency- and phase-dependent False-Alarm Probability of 
an amplitude level was introduced as the probability that ran- 
dom noise in the time domain with the same rms error as the 
given time series produces a peak in the DFT amplitude spec- 
trum which is at least as high as the corresponding amplitude 
level for the time series itself: if a peak is assigned a False- 
Alarm Probability of 0.00001, its risk of being due to noise 
is 1 : 100000. In this section, the spectral significance as a 
more informative quantity is introduced ( 15.5. It . It is the inverse 
False- Alarm Probability (in this case, 100000) scaled logarith- 
mically. In the present example, the conversion of a False- Alarm 
Probability of 0.00001 into spectral significance returns 5. In 
this context, the spectral significance is presented as a logarith- 
mic measure for the number of cases in one out of which the 
considered amplitude would be an artifact. 

Plotting the spectral significance vs. frequency yields the sig- 
nificance spectrum, and the identification and consideration of 
the highest peak in this spectrum may lead to a statement on 
the significance of the entire spectrum. The argument is similar 
to many existing significance estimates (e.g., Scargle 1982): if 
the highest peak in the spectrum is below some limit, the entire 
spectrum has to be considered insignificant. But instead of using 
the (statistically biased) signal-to-noise ratio as a threshold, the 
spectral significance is employed. The application of the spec- 
tral significance concept to the highest peak out of a sample is 
briefly discussed (15.5.2b . 

The formal correspondence to traditional techniques, namely 
signal-to-noise ratio and the Lomb-Scargle Periodogram, is of 
special interest and hence provided subsequently ( 15.5.3ll534l i. 

5.5.1. Definition 

To enhance the compatibility to the popular signal-to-noise ratio 
criterion (see |5.5.3l ), the spectral significance of a DFT amplitude 
is defined as 



sig (A | to, 6) := - log i> FA (A | oj, 9) 



or - using eqs.[T9landl22l- 



sig(A| o),0) 



KA 2 log e 

4(x 2 ) 



cos 2 (6> - 



_6b) sin 2 (0-0 o ) 



(23) 



(24) 



with the normalized axes, ao and fio, as defined by eqs. fT7lfT8l 
and the orientation of the ellipse in Fourier Space according to 
eq.fTOl Since the angle 6\) was chosen to refer to ao as the semi- 
minor axis of the rms error ellipse, ao now corresponds to the 
phase of maximum spectral significance for a given frequency. 

The concept of spectral significance computation relies on 
the analytical comparison of the DFT amplitude generated by a 
measured time series to noise at the same variance as the time 



4 to be distinguished from significance in the sense of a confidence 
threshold as used in hypothesis testing. 



series under consideration. Unless the population variance ^x 2 ^ 
of the noise used for comparison is given by theory and/or other 
observations than the ones under consideration, the sample vari- 
ance (x 2 ^i of the observable may be used as an estimator. 
The Cartesian representation of eq.[24l 



sig (a ZM , b ZM | oj) 



K\oge 



q Z M cos 6*0 + b ZM sin < 
ao 



fl Z M sin go - b ZM cos 6> 



(25) 



is useful for practical applications and will be employed for the 
implementation of statistical weights (see 18.11 ). The subscript 
'ZM' indicates zero-mean corrected time series data, in consis- 
tency with eqs.[3j|4] 

Thanks to the logarithmic scaling, the spectral significance 
appears as a product form (as opposed to an exponential func- 
tion), where one factor (the bracket term) contains all informa- 
tion on the ellipticity of the underlying PDF in Fourier Space 
and is entirely determined by the time-domain sampling. This 
term is scaled according to the squared amplitude. The serendip- 
itous consequence of this separation is that the evaluation of 
the bracket term applies to all datasets with the given sampling. 
In a prewhitening cascade, the sampling of the time series will 
not change. Consequently, the bracket term remains valid for 
the entire sequence and has to be computed only once. In the 
prewhitening cascade itself, it is sufficient to rescale the bracket 
term by the squared amplitude, which speeds up the computa- 
tions considerably. 

Thanks to this formal separation, it is possible to pack all the 
characteristics of the time-domain sampling into an amplitude- 
independent function of frequency and phase. This will lead to 
the Sock Diagram (15. 6t . 

A further practical advantage of this separation is the oc- 
currence of the population variance (x 2 ) independently of fre- 
quency and phase. For small samples, the higher uncertainty of 
the estimated population variance may be overcome by using the 
sample variance (x 2 ^ instead of the population variance (x 2 ^ and 
increasing the spectral significance limit for peak acceptance ac- 
cordingly. 

5.5.2. Spectral significance for a statistically independent 
sample 

One may desire to evaluate the spectral significance for the high- 
est out of a sample of peaks in the significance spectrum - in 
analogy to the procedure presented by Scargle (1982). His argu- 
ments may be applied to the spectral significance directly. 

For a given spectral significance level sig, the probability 
of an amplitude level generated by a noise process to exceed 
the spectral significance limit sig is the False- Alarm Probability 
<Dfa- It is linked to the spectral significance via eq.|23] The com- 
plementary probability that such an amplitude level is below sig 
is 1 - <1>fa- Given a sample of N such amplitude levels, which are 
statistically independent, the probability of none exceeding sig 
is (1 - 0>fa) N ■ Again, the complement, 1 - (1 - Ofa)^, returns 
the probability for at least one peak out of the sample to exceed 
sig. In other words (and using eq.|23]to substitute for Ofa), the 
False- Alarm Probability for the maximum of a statistically inde- 
pendent sample of N peaks is 



fa 



1 - (l - io~ sis f 



(26) 
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This False-Alarm Probability may be transformed into a spectral 
significance (using eq.[23]again), which yields 



sig 



•log 



l-(l-l(T si f 



(27) 



for the spectral significance of the maximum out of a sample of 
N statistically independent peaks in the significance spectrum. 
Solving eq.[27]for sig yields 



Slg : 



•log 



(.-V 



1 - 10- si s 



(28) 



which allows one to immediately convert a chosen threshold for 
maximum spectral significance into 'individual' spectral signif- 
icance (as given by eq.l24b. In most practical applications, the 
approximation 



sig « sig + log N 



(29) 



is sufficiently accurate. 

For an equidistant time series consisting of K data points, 
the number of statistically independent DFT amplitudes is j, if 
K is an even number. One may set N := | as a rough estimate 
also for the non-equidistant case, which performs quite reliably 
in practical applications. 

For example, if a maximum spectral significance threshold 
of 5.46 shall be applied to a time series consisting of 1 000 
data points, the corresponding 'individual' spectral significance 
threshold would be 2.76, which is in good agreement with the 
numerical results by Kuschnig et al. (1997), who obtained a sig- 
nificance of a; 3 in this case, examining 19 300 synthetic time 
series. 



5.5.3. Connection between spectral significance and 
signal-to-noise ratio 

The correspondence between spectral significance and signal- 
to-noise ratio is obtained through substitution of eq.|24]by the 
amplitude signal-to-noise ratio according to eq. IB.lll This 
yields 



sig (A | a), 0) 

7rloge / A 
2 \<A) 



cos 2 (0 - d ) sin 2 (0 - O ) 



(30) 



Given uniformly distributed arguments of the trigonometric 
functions, the expected values of both ckq and fto evaluate to 1 . 
Thus an approximation for the correspondence between ampli- 
tude signal-to-noise ratio and spectral significance is obtained 
by 



sig (A) 



7rloge 



A 



(31) 



For example, an amplitude signal-to-noise ratio of 4 - as a sug- 
gested significance estimator by Breger et al. (1993) - roughly 
corresponds to a spectral significance of 



sig (4 <A» * 4tt log e * 5.4575 . 



(32) 



A numerical simulation for 42 597 time series, each consist- 
ing of 14400 equidistantly sampled data points representing a 
single sinusoidal signal with randomly chosen amplitude, fre- 
quency, and phase, plus Gaussian noise with randomly chosen 



rms error was performed. The agreement with eq.|3T]for spectral 
significances < 200 and a corresponding signal-to-noise ratio of 
w 25 is excellent. At higher spectral significances, the following 
effect has to be taken into account: 

The spectral significance is related to a peak generated by 
noise with the same variance as the observable under considera- 
tion, i. e. without prewhitening. Frequently, a signal-to-noise ra- 
tio calculation relies on the rms residual, i. e. with the peak under 
consideration prewhitened. Since (on average) a sinusoidal sig- 
nal with amplitude A contributes 4p to the variance in the time 
domain, the population variance of residuals in the time domain 
evaluates to 



(33) 



Using this relation, eq. lB.l 11 permits one to calculate an ampli- 
tude noise level with the considered peak prewhitened, accord- 
ing to 



and writing eq.[5T|in terms of N (A) leads to 
log e Kn 



sig (A) 



i-2 



(34) 



(35) 



2 « + 2K [m\ 

Fig.|2]shows the difference in spectral significance for ampli- 
tude noise with and without prewhitening as a function of time 
series length. This difference increases dramatically as datasets 
become shorter. When relying on signal-to-noise ratio estima- 
tion, the issue of prewhitening is crucial and in some way philo- 
sophical. If the considered peak is believed to be 'true' a priori, 
then it will have to be prewhitened for the noise calculation. If 
it is assumed to be an artifact, noise will have to be calculated 
without prewhitening. However, the answer in terms of spectral 
significance is unique and clear: since the spectral significance 
analytically compares the considered time series with a random- 
ized one, i. e. keeping the time-domain rms deviation the same, 
the correspondence to a non-prewhitened signal-to-noise ratio 
would be correct - if desired at all. The computation of the spec- 
tral significance allows one to completely omit the potentially 
ambiguous computation of noise levels by averaging amplitude 
over a frequency interval about a considered peak. In general, the 
latter are not objective, since the resulting noise level depends on 
the choice of the interval width and whether unresolved peaks 
are encountered. 

5.5.4. Spectral significance and Lomb-Scargle Periodogram 

If the influence of the zero-mean correction on the statistical 
characteristics of the Fourier Coefficients is neglected, eqs.[T7l 
[18] simplify to 



ar o (w,0 o ) := 



A)(«,0 O ) := 




2 



K-l 

K V sin 2 (u>t k - O ) 



(36) 



(37) 



respectively. In this case, the evaluation of 0q - as performed in 
AppendixlAl- transforms eq.fTOlinto 



tan 20o (a>) = 



Ef =0 ' sin 2^ 
Xfjo cos 2ut k 



(38) 
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Fig. 2. Spectral significance (sig) associated with an amplitude 
signal-to-noise ratio of 4, without {dashed line) and with (solid 
line) prewhitening of the considered peak, depending on the 
number of time series data points. If the amplitude noise is cal- 
culated after prewhitening, the spectral significance associated 
with a signal-to-noise ratio of 4 decreases below the equivalent 
limit of 5.46 for short datasets. 



and eq.|24] becomes fully compatible to the definition of the 
Lomb-Scargle PeriodograrrQ In this context, the improvement 
of the spectral significance compared to the Lomb-Scargle 
Periodogram is the correct statistical implementation of the ar- 
tificially fixed time series mean. Remembering that the initial 
condition that led to the Lomb-Scargle Periodogram was a least- 
squares solution (unfortunately without handling the zero-mean 
correction appropriately), the significance spectrum will satisfy 
the corresponding least-squares condition for the zero-mean cor- 
rected data as well. This is in perfect agreement with the results 
of simulations, as presented in |6.2| 

Since the Fourier-Space effect of the zero-mean correc- 
tion tends to vanish for infinitely long time series, one would 
expect the difference between the Lomb-Scargle Periodogram 
and the spectral significance to become small (or even negligi- 
ble) for long datasets. The performance of DFT, Lomb-Scargle 
Periodogram, and spectral significance is compared for time se- 
ries of different length in 16.41 

On the other hand, the spectral significance for the Floating- 
Mean Periodogram (Cumming, Marcy & Butler 1999), the 
statistic of which appears not to suffer from zero-mean correc- 
tion problems, is directly obtained by using eq. [24] with a?o, /3q 
and 9o as given above (eqs. [36l[37ll38T l. 

5.6. The Sock Diagram 

In 15.5.11 the spectral significance was introduced as a repre- 
sentation of the statistical properties of time-domain sampling 
in Fourier Space, applied to the amplitude spectrum of a given 
observable (eq.l24b. Selecting a single frequency and phase an- 
gle, one finds the spectral significance to be proportional to the 
squared amplitude. This property makes it easy to determine an 
analogy to the spectral window. 



3 Lomb's (1976) original publication deals with periodogram analy- 
sis in terms of power. The expression of the present results in terms of 
squared amplitude produces explicit consistency. 



For classical DFT-based methods, the spectral window is fre- 
quently used to determine the effects of time series sampling in 
the frequency domain. It is defined as the DFT amplitude spec- 
trum of a constant in the time domain, normalized to an ampli- 
tude of 1 at zero frequency. In the case of non-equidistant sam- 
pling, peaks in the amplitude window indicate periodicities in 
the sampling of the time series corresponding to the frequencies 
where these peaks occur. A frequently returned signature in the 
spectral window of astronomical single-site measurements is a 
set of peaks at integer multiples of Id -1 . This is the Fourier- 
Space representation of periodic data gaps due to daylight and 
termed '1 d -1 aliasing'. 

Since the spectral significance is a more subtle and sensi- 
tive quantity taking into account more information on the time- 
domain sampling, it is possible to introduce a more sensitive 
analogy to the spectral window, the Sock Diagrcm^. As with all 
formalism in terms of spectral significance, it is frequency- and 
phase-resolved. 

In analogy to the spectral window, the Sock Diagram rep- 
resents the spectral significance variations with frequency and 
phase for a constant amplitude, or amplitude signal-to-noise ra- 
tio. It provides quantitative information on the influence by the 
time-domain sampling on the spectral significance. Furthermore, 
it displays the quality of signal-to-noise ratio-based estimation 
for all possible frequencies and phases at a glance. Providing 
both information on gaps in the sampling and frequency regions 
with poor accuracy of the DFT amplitude, the Sock Diagram 
shows where DFT-based signal-to-noise ratio estimation fails. 

The normalization of the Sock Function, 



sock (u>, 0) 



cos 2 (0 - O ) 



sin 2 (0 ■ 



ft 



(39) 



provides an expected value 1 on the assumption of uniformly 
distributed arguments of the trigonometric functions. Thus 



sig (A | a>, 0) 



KA 2 loge 
4(x 2 ) 



sock(w, 0) , 



(40) 



as obtained from eq.[24] permits to compute the spectral sig- 
nificance associated with an amplitude A based on the Sock 
Diagram. In terms of signal-to-noise ratio, eq.[30levaiuates to 



sig (A | a>, 0) 



7rloge 



A 



sock(<y, 0) . 



(41) 



Since the orientation 0q of the elliptical PDF in Fourier Space 
appears only in the form - 0q in all equations related to spec- 
tral significance, phase angles in Figs.[3]and|4]consistently refer 
to the position of the semi-minor axis of the rms error ellipse 
to achieve better visibility: using - 0q instead of provides a 
constant alignment of the spectral significance maxima in phase 
for all frequencies. 

Fig.|3]displays the Sock Diagram for typical non-equidistant 
sampling representing 10 nights yielding 381 data points of 
single-site V photometry of star # 89 in the young open cluster 
IC4996 (Zwintz et al. 2004; Zwintz & Weiss 2006). Regardless 
of the astrophysical dimension, these observations have been 
chosen as the primary test dataset, because they impressively 
show all the characteristics typical for single-site measurements 
that make multifrequency analysis a puzzling task. 

The Sock Diagram uses three-dimensional polar coordi- 
nates: for each frequency, the angular coordinate refers to phase, 



6 The nomenclature is motivated by the shape of the diagrams, if the 
time-domain sampling is close to equidistant (see, e. g., Fig.[4j. 
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Fig. 3. Sock Diagram (in cylindrical coordinates) for the V 
measurements of IC 4996 # 89, displaying the relative variations 
of the spectral significance (radial coordinate) with frequency 
(height coordinate) and phase (azimuthal coordinate) for a con- 
stant signal-to-noise ratio, and hence providing an overview of 
the effect of time-domain sampling properties in Fourier Space. 
Better visibility is achieved by additional color coding, referring 
to the color bar in the lower panel. The susceptibility of DFT 
spectra to 1 d" 1 aliasing shows up in spectral significance varia- 
tions of up to a factor m 35. 



and the radial component is associated with the spectral signifi- 
cance normalized to an expected value of 1 (according to eq.[39l. 
To enhance the visibility, the radial information is color-coded 
additionally. 

The spectral significance variations close to 0.5, 1, 1.5, and 
2 d _1 clearly indicate frequencies where signal-to-noise ratio es- 
timation of False- Alarm Probability is potentially misleading. 

An example for excellent sampling is shown in Fig.|4] repre- 
senting 72 055 MOSlQ data points of f Oph Fabry Imaging pho- 
tometry (Walker et al. 2004; Walker et al. 2005) with a duty cycle 
of * 99.9 %, obtained between May 17 and June 14, 2004. The 
data reduction is performed according to the procedure described 
by Reegen et al. (2006). Even for frequencies close to 14.2 d -1 
- corresponding to the orbital period of the satellite (101.4min; 
Walker et al. 2003) - the spectral significance varies with fre- 
quency and phase by only « 0.1 %. 

7 MOST is a Canadian Space Agency mission, jointly operated by 
Dynacon Inc., the University of Toronto Institute of Aerospace Studies, 
the University of British Columbia, and with the assistance of the 
University of Vienna, Austria. 



Fig. 4. Sock Diagram for MOST ('Microvariability and 
Oscillations of STars'; Walker et al. 2003) measurements of 
£Oph. Due to practically equidistant sampling (duty cycle « 
99.9 %, the relative variations of the spectral significance with 
frequency and phase are « 10~ 3 , even close to the orbital period 
of the satellite (= 14.2 d _1 ). As illustrated by the color bar in the 
lower panel, the color coding scale differs from Fig. [3] consider- 
ably. 



5. 7. The marginal distribution of phase angles 

An alternative method to the computation of spectral signif- 
icance as a function of amplitude and phase simultaneously 
would be to use the phase information separately in addition 
to, e. g., some traditional signal-to-noise ratio-based reliability 
estimate. The idea is that for white noise in the time domain, 
the Fourier phases are not uniformly distributed and that the ad- 
ditional incorporation of the phase information may provide a 
more detailed criterion on the reliability of a peak. 

The overall distribution of phases at a given frequency is im- 
mediately obtained by integrating the bivariate PDF (eq.[T4b over 
amplitude, which yields 



Pi cos 2 (6 - 8 Q ) - a 2 Q sin 2 (6 - 6 ) ' 

normalized for phases 6 on an interval of width n. This avoids 
the ambiguity of solutions for O modulo n, as discussed in 15. 31 
The expected value of this probability distribution is 6q - |: 
phases associated to a low spectral significance (for given am- 
plitude) generally occur more frequently than phases for which 
the spectral significance is high. The special case chq = fto yields 
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Fig. 5. Expected values (solid lines) and rms errors (gray-shaded 
areas) of the probability distribution of phase angles in Fourier 
Space for white noise on the IC 4996 # 89 sampling. 



an upper limit of ^= m 52° for the standard deviation obtained 
by eq.|42] 

In statistical terms, the phase distribution provided above is 
a marginal distribution of the bivariate PDF given by eq.[T4l 
Instead of statistically examining a bivariate distribution, one 
may use the two marginal distributions (in the present case, the 
amplitude and phase distributions) instead, but encountering a 
loss of accuracy, since correlations between the two random vari- 
ables remain unresolved. Therefore, from the theoretical point 
of view, it is advisable to use the bivariate form rather than the 
marginal distributions. An additional problem is that there is 
no analytical solution for the integral over phase, which would 
transform the bivariate PDF into the marginal distribution of am- 
plitudes. One would have to employ classical techniques relying 
on the amplitude signal-to-noise ratio instead. 

In addition to these theoretical objections to the examination 
of marginal distributions, there is a major practical constraint: 
Fig. [3] displays the expected values and standard deviations of 
Fourier phase for frequencies between 1.3 and 2.7 d _1 , given the 
sampling of IC 4996 # 89. In the considered frequency range, the 
rms scatter of phases is greater than 44° for the entire frequency 
range under consideration. Compared to the upper limit of 52°, 
this scatter will probably be much too high to reveal additional 
information on the reliability of a peak, if the marginal distribu- 
tion of phases is considered. 

The PDF of phases provided by eq.|42] are in perfect agree- 
ment to the results of numerical simulations. 

5.8. Spectral significance-based signal recovery 

Practical applications are frequently based on a cascade of con- 
secutive prewhitenings. In this case, the frequency at maximum 
spectral significance, u>, is considered as that of the strongest sig- 
nal component, which is understood to be the next candidate for 
prewhitening. The next step is to determine the best fit for am- 
plitude and phase at co. The least-squares condition for the best 
sinusoidal fit at constant frequency u> is obtained by 

d K ~ l 2 
— J^[x k -A [cos (cbt k -6)- (cos (cbt, - #))]} = , (43) 



^ J] [x k - A [cos (cbt k - 0) - (cos ( cbt, - &))] } = , (44) 



*=<) 



where the term (cos (tbti - &)} = ^ cos [&ti ~ takes into 
account that the discrete fit has to be zero-mean corrected, if 
applied to zero-mean corrected data x k . The derivatives lead to 



2^ x k cos [cbt k - 0j — A I V cos 2 (cot k - 0} 



fc=0 



K 



,k=0 



COS 



(ct)t k - e)]' 



(45) 



x k sin (u)t k - 9} — A I ^ cos (o>t k - fl) sin (u>t k - 9} 

k=0 U=0 

l - [cos [cbt k - 0)] [sin (cbtk -e)] \ . (46) 



K 1 



Solving for yields 



K-l K-l K-l 

2_j 2 XI ^ sin ^ ^ ~ ^ ~ sin 6 ^ tm ~ tk ^ Xk cos ( &f/ ~ v 



k=0 1=0 m=0 



0. 



which finally reduces to 
tan# 



Pi Ef=o cos &h - Pi Ef=o sin &t k 



P\ 2/t=o sin &h ~ Pi 2a=o cos fak 



(47) 



(48) 



using 



Pi := 



K-l 



(K-l 



\ (K-l \ 

K ^ cos <2>t k sin <2>t k - ^ cos cbt k ^ sin cbt k 



k=0 
K-l 



k=0 



U=o 



(K-l 



P2 := K ^ cos 2 o)t k - X cos &>h 

k=0 U=0 

K-l (K-l \ 

P3 : = K ^ sin 2 ut k - ^ sin cbt k 



k=0 



\k=0 



(49) 
(50) 
(51) 



Eq.l48l provides two solutions for 6. To pick the least-squares 
related solution is an easy task for a program. 

Once frequency co and phase 6 of the signal are evaluated, 
eq.|45]immediately yields the best-fitting amplitude, 



A = 



£f = o *k cos [cbt k - 0) 



cos 2 (cbtk -0)- [Efjo cos fak ~ $)] 



(52) 



k=0 



6. Numerical tests 

Two sets of numerical simulations have been performed, the first 
one to confirm the agreement between the theoretically evaluated 
spectral significance and a straight-forward histogram analysis, 
and the second one to quantitatively compare the accuracy of 
frequencies returned by the various methods discussed in this 
paper. 
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Fig. 6. Spectral significance associated with an amplitude signal- 
to-noise ratio of 4 for the V measurements of IC 4996 #89. The 
blue and red graphs refer to orthogonal phases in Fourier Space. 
The expected spectral significance of m 5.46 is displayed by the 
dashed black line. The vertical dashed-dotted green line indi- 
cates the frequency of 1.956d _I , which was selected for a nu- 
merical simulation to check the validity of the theoretical ap- 
proach. 

6. 1 . Comparison of analytical and numerical solutions 

Extensive numerical simulations have been performed in order 
to examine the validity of the above theoretical considerations. 
The simulations are set up in a way that compares closely to real 
life. The algorithm to generate Gaussian noise is based on the 
Central Limit Theorem (Stuart & Ord 1994, pp. 310f), providing 
fast convergence of a mean value of uniformly distributed ran- 
dom variables towards a Gaussian distribution with increasing 
number of variables. All of the following numerical applications 
rely on Gaussian noise produced by summing up 10 values from 
the random number generator. A comprehensive compilation on 
alternative methods to generate Gaussian noise is provided by 
Firneis (1970). 

An all-in-one simulation for many frequencies, many phases, 
and many amplitudes would result in tremendously time- 
consuming computations. In order to keep the effort reasonable, 
only a single frequency is picked to examine the phase depen- 
dency of the spectral significance for different amplitudes. 

Fig. [6] displays the spectral significance associated with an 
amplitude signal-to-noise ratio of 4 - according to eq.[30]- for 
the V data of IC 4996 #89. The blue and red graphs represent 
two orthogonal phases in Fourier Space. The comparison of nu- 
merical and theoretical results is performed for a frequency of 
1.956d _1 , where the deviation of spectral significances from the 
expected value 5.46 is a; 5 for selected phase angles. Since it is 
desired to examine the presence of spectral significance varia- 
tions with phase in numerical results where predicted by theory, 
this frequency is a reasonable choice for a test. 

The procedure consists of five steps. 

1 . Zero-mean Gaussian noise with a standard deviation of 1 is 
imposed upon the IC 4996 #89 sampling. 

2. The zero-mean correction is performed to avoid scatter in the 
mean of the finite time series about the population mean (see 

ED. 

3. The Fourier-Space phase angle for the synthetic time series 
at 1.956d~ 1 is evaluated. 

4. The Fourier-Space noise is evaluated upon the variance of 
the time series according to eq. lB.l 11 
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Fig. 7. Phase-dependent spectral significance (radial coordinate, 
referred to by the vertical axis) for the V measurements of 
IC 4996 #89 at a constant frequency of 1.956d~' as a function 
of phase, referring to an amplitude signal-to-noise ratio of 1 . The 
solid line represents the theoretical result. A numerical simula- 
tion for 1 .5 million synthetic datasets (dots) - counted in phase 
bins of width 1 - illustrates the excellent agreement. The sys- 
tematically higher deviation of the numerical results from the 
theoretical solutions at higher spectral significances is due to 
sample-size effects: phase bins associated with a high theoret- 
ical spectral significance are hit less often than others. Hence the 
total number of Fourier Amplitudes to be examined differs from 
phase bin to phase bin. The dashed line represents the expected 
spectral significance of w 0.34 associated with a signal-to-noise 
ratio of 1 for uniformly distributed arguments of all trigonomet- 
ric functions. 

5. If the Fourier Amplitude exceeds the signal-to-noise ratio 
under consideration, the resulting histogram is updated cor- 
respondingly. 

The number of amplitudes exceeding the preselected limit rela- 
tive to the total number of synthetic datasets provides an estima- 
tor for the False-Alarm Probability (and consequently spectral 
significance) associated with the chosen signal-to-noise ratio. 

Figs. [7] to [9] display the agreement between theory and sim- 
ulations for amplitude signal-to-noise ratios of 1, 2, and 3. The 
corresponding numbers of synthetic datasets computed are 1.5 
million, 22 million, and 250 million, correspondingly. In all 
three cases, phase bins of width 1° were used for the histograms. 
For a signal-to-noise ratio of 4, the number of synthetic datasets 
required for an acceptable numerical accuracy would exceed the 
capabilities of computational performance, especially those of 
system random number sequences, by far. 

Phases associated with a high spectral significance occur less 
frequently than others (see 15. 7b . whence the scatter of numerical 
results is systematically higher at higher spectral significance 
levels due to sample-size effects. Taking this into account, the 
overall quality of fits is good. In neither of the three plots, a 
systematic deviation of the numerical results from the analyti- 
cal functions is visible. However, the theoretical prediction for 
the overall shape is recovered by the numerical results, indicat- 
ing that the ellipticity given by the normalized axes ao and /3q 
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Fig. 8. Same as Fig. [7] but for an amplitude signal-to-noise ratio 
of 2. The dots refer to a numerical simulation for 22 million 
synthetic datasets. The expected spectral significance level is » 
1.36 (dashed line). 



90 




Fig. 9. Same as Fig. [7] but for an amplitude signal-to-noise ratio 
of 3. The dots refer to a numerical simulation for 250 million 
synthetic datasets. The expected spectral significance level is » 
3.07 (dashed line). 

matches the 'reality' of simulation. Also the orientation of the 
ellipse (6q) appears consistent with the synthetic data. Finally, 
the consecutive comparison of Figs. [7] to [9] offer convincing ev- 
idence of the spectral significance at a constant frequency and 
phase angle to be proportional to the squared amplitude, as pre- 
dicted by the theoretical solution. 

6.2. Accuracy of peak frequencies 

One of the major issues in period searches is the accuracy of 
the resulting signal frequencies. Classical DFT with or without 



improvement of peak frequencies by a subsequent least-squares 
fiO Lomb-Scargle Periodogram, and spectral significance analy- 
sis are based on the identification of the highest peak in a chosen 
frequency interval. 

A quantitative description of the quality of resulting frequen- 
cies with and without noise was obtained by means of simula- 
tions. For the sampling of the IC 4996 #89 dataset (V), the fol- 
lowing procedure is performed: 

1 . a synthetic signal of given frequency fo and amplitude with 
random phase (uniformly distributed on [—n, n]) is gener- 
ated, 

2. Gaussian noise with given standard deviation is added (op- 
tional), 

3. zero-mean correction of the resulting dataset is performed, 

4. a DFT amplitude spectrum, a Lomb-Scargle Periodogram, a 
spectrum of phase dispersiorQ, and a significance spectrum 
are computed for a predefined frequency range, 

5. an interpolation routine is performed to find the maximum 
(or minimum phase dispersion, respectively) in each of the 
four spectra, and 

6. the deviation of the resulting frequency from the correspond- 
ing input frequency, Af, is evaluated. 

For each frequency, deviations Af were collected to obtain a 
frequency-dependent rms error of recovered frequencies, where 
only attempts with |A/| < 0.5 d _I were taken into account. 
Attempts resulting in |A/| > 0.5 d _I were considered alias. 

The example illustrated by Figs.[lj3[TT]uses the IC 4996 # 89 
data. 

Fig. [10] shows the comparison of the five methods. For each 
frequency, 1 000 datasets were examined. The figure displays 
the rms deviation (denoted |A/|) of resulting frequencies as a 
function of the signal frequency fo. The top panel refers to a 
clean sinusoidal signal without noise. Towards the bottom panel, 
Gaussian noise with increasing standard deviation is added (cor- 
responding to signal-to-noise ratios of 40 and 4, respectively, 
using eq.!55V 

For a signal without noise (top panel), the frequency ac- 
curacies of the Lomb-Scargle Periodogram exceed that of the 
DFT by a factor w 10, and both profiles show accuracy varia- 
tions with frequency. The Lomb-Scargle Periodogram does not 
take into account zero-mean correction, as shown in 15.5.41 This 
leads to systematic effects not only for short datasets, but also 
in frequency regions where the phase coverage becomes poor. In 
the absence of noise, frequencies at maximum spectral signifi- 
cance are « 100000 times more accurate than the DFT results, 
which is in the accuracy domain of the least-squares solutions. 
Apparently the accuracy of the spectral significance peaks is 
only limited by the internal accuracy of the computer for double- 
precision floating-point numbers. In addition, the accuracy of 
spectral significance is, in this respect, practically independent 
of frequency. 

Once noise is added to the signal, the peak frequency ac- 
curacy of spectral significance becomes much poorer (in consis- 
tency with O'Donoghue & Montgomery 1999), but as illustrated 
by Fig. [10] the method does not get worse than either alterna- 
tive procedure. The improvement of the frequency accuracy by 
the spectral significance solution for high signal-to-noise ratio is 
valuable, since the exact knowledge of frequency provides exact 

8 DFT suffers from systematic deviations of peak frequencies 
(Kovacs 1980). 

9 The PDM tested here is based on 300 equidistant phase bins of 
constant width rad. 
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Fig. 10. Frequency accuracy (rms scatter of resulting frequencies about the initial frequency for a single sinusoidal signal with 
uniformly distributed phase) vs. signal frequency of five methods: DFT (solid orange line), DFT plus least-squares fitting (solid 
blue line), Lomb-Scargle Periodogram (solid green line), PDM (dotted black line), and spectral significance (solid red line). The 
time-domain sampling represents the V measurements of IC 4996 #89. The top panel is the frequency accuracy for a pure signal. 
Towards the bottom panel, Gaussian noise with increasing standard deviation is added to the signal in two steps, corresponding to 
amplitude signal-to-noise ratios of 40 and 4, respectively. Only those attempts where the distance between resulting frequency and 
input frequency does not exceed 0.5 d -1 were taken into account, the rest was considered alias (see |6.3t . The results are based on a 
numerical simulation investigating 1 000 datasets for every frequency. 



prewhitening. Only exact prewhitening guarantees that weaker 
frequency components are not contaminated by spurious residu- 
als of the higher peaks. 



6.3. Aliasing 

One potential weakness of the DFT, which usually becomes puz- 
zling in the analysis of non-equidistantly sampled time series, 
is the susceptibility to aliasing. Whereas measurements over in- 
finite time would lead to perfectly delta-shaped signal compo- 
nents, the spectral window of 'real' data is convolved into this 
'ideal' spectrum. For typical single-site observations, 1 d -1 side- 
lobes are visible around every signal peak, and in many cases it 
is hard to decide which peak out of the 'comb' of aliases is the 
right one. For multiperiodic signal, aliases of individual com- 
ponents may interfere. If this interference leads to an amplifica- 
tion, an erroneous component identification is inevitable, when 
only relying on the highest amplitude. This applies to the spec- 
tral significance as well and reflects a major weakness of the 
step-by-step prewhitening technique rather than the calculation 
of the spectral significance. Strategies to overcome the aliasing 
problem have been examined for many years (e. g. Ferraz-Mello 
1981; Roberts, Lehar & Dreher 1987; Foster 1995). 



Fig.rm displays a comparison of DFT, Lomb-Scargle 
Periodogram, PDM, and spectral significance in terms of frac- 
tion of aliases among the 1 000 simulated test datasets used in 
16.21 As in Fig. [10] the top panel refers to signal without noise, 
and Gaussian noise of increasing standard deviation is added to- 
wards the bottom panel. Not surprisingly, the susceptibility of 
the DFT to aliasing does not improve, if a least-squares algo- 
rithm is appended. The portion of aliases obtained using the 
Lomb-Scargle Periodogram is fairly below that of the DFT. 
Finally, the figure clearly shows that the spectral significance 
analysis is more stable against aliasing than either compared 
strategy. Without noise, not a single alias peak occurs among 
altogether 300000 simulated datasets. For a signal-to-noise ra- 
tio of 40, the percentage of alias peaks is less than a third of 
the corresponding percentage of the alternative methods, even at 
Id -1 . Only for the highest noise level, to which the bottom panel 
refers, all results are quite comparable. 

Since the spectral significance is not initially intended to 
correct for aliasing, its capabilities to avoid potential misiden- 
tification of peaks are limited to the extent of spuriously am- 
plified peaks by systematic inhomogeneities of the frequency- 
domain noise. Simulations and practical experience show that 
the systematic errors of spectral noise are an inferior error source 
compared to the interference of aliases of different signal com- 
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Fig. 11. Relative number of aliases vs. signal frequency of four methods: DFT plus least-squares fitting (dashed blue line), Lomb- 
Scargle Periodogram (solid green line), PDM (dotted black line), and spectral significance (solid red line). In this context, a result 
is considered alias, if the absolute difference between resulting frequency and signal frequency exceeds 0.5 d -1 . The DFT without 
least-squares fitting is not displayed here. It produces essentially the same fraction of alias, because the least-squares fit allows a fine 
tuning of resulting peak frequencies only, which is negligible considering frequency errors of 0.5 d -1 and more. The time-domain 
sampling represents the V measurements of IC 4996 #89. The top panel is the fraction of alias peaks for a pure signal. Towards 
the bottom panel, Gaussian noise with increasing standard deviation is added to the signal in two steps, corresponding to amplitude 
signal-to-noise ratios of 40 and 4, respectively. In the upper two panels, the DFT graph is offset vertically by -0.01 and the Lomb- 
Scargle graph by +0.01 to provide better visibility at and close to zero. The results are based on a numerical simulation investigating 
1 000 datasets for every frequency. 



ponents. Presently investigations are performed to apply the 
spectral significance technique to simultaneous multi-frequency 
solving algorithms for an improved treatment of aliases. The re- 
sults are planned to be presented in paper II. 



6.4. Sample size effects 

A further example is provided to compare the performance of 
DFT, Lomb-Scargle Periodogram, and spectral significance for 
datasets of equal (or at least comparable) characteristics, but dif- 
ferent length. 

The 'long' dataset represents 34 nights (not consecutive, 
but covering 8 1 d) of single-site photometry of the Delta Scuti 
star 44Tau (Str0mgreny, 2 981 data points; Antoci et al. 2006) 
acquired by the Vienna University Automatic Photoelectric 
Telescope (APT; Strassmeier et al. 1997; Granzer et al. 2001). 
More details on the data are provided in Section|7] The 'short' 
dataset is a subset of 7 consecutive nights (619 points). For 
the subsequent investigations, only the sampling of these two 
datasets is used. Peak frequency accuracies are computed ac- 
cording to the procedure described in 16.21 



Fig. [12] shows the frequency accuracies of the compared 
methods for the short dataset in the top panel. The display is ex- 
actly according to Fig.[l0] The bottom panel contains the iden- 
tical analysis of frequency accuracies for the long dataset and 
indicates an improved overall precision of the DFT and Lomb- 
Scargle Periodogram, if the number of data increases. 

However, 1 d -1 aliasing persists also for the long time se- 
ries (Fig.ll3t. which is compatible with the persisting alias peaks 
in the spectral window at integer multiples of Id -1 . Since the 
zero-mean correction is represented by the subtraction of a con- 
stant from all observables in the entire time series (referring to 
eqs.[3] @), it is not surprising that the techniques which ignore 
the statistical effects of zero-mean correction retain these spec- 
tral window-related weaknesses also in case of a large number 
of data points. 

The result of spectral significance calculations is not pro- 
vided in these graphs: the deviation of frequencies is practi- 
cally equal to zero independently of frequency - as for the 
IC 4996 # 89 data (top panel in Fig.[l0>. 
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Fig. 12. Frequency accuracy (rms scatter of resulting frequen- 
cies about initial frequency for a single sinusoidal signal with 
uniformly distributed phase) vs. signal frequency of DFT (or- 
ange) and Lomb-Scargle Periodogram (green). The simulated 
time series represent pure signal without noise at uniformly dis- 
tributed phase angles. The top panel represents the frequency 
accuracy for seven consecutive nights of single-site photometry 
of 44Tau (Str0mgreny, 619 data points). These data are a subset 
of the 8 1 days long time series presented in the bottom panel (34 
nights, 2981 data points). The plots illustrate the improvement 
of the overall frequency accuracy of both methods for the longer 
dataset. The corresponding deviations obtained using the spec- 
tral significance are practically zero, similar to the top panel in 
Fig. [10] and thus not presented here. Only those attempts where 
the distance between resulting frequency and input frequency 
does not exceed 0.5 d _I were taken into account, the rest was 
considered alias. The results are based on a numerical simula- 
tion investigating 1 000 datasets for every frequency. 



7. SigSpec: Practical application 

The formal concept of spectral significance does not only 
provide reliable information on the sampling characteristics in 
the time domain, but also allows to include the computations 
into a prewhitening sequence. SigSpec realizes the entire 
procedure as a high-performance algorithm. Implementations 
for various operating systems and CPUs for free down- 
load at http://www.astro.univie.ac.at/SigSpec 
The ANSI-C source code is available on request 
(reegen@astro . univie . ac . at). 

The SigSpec technique was applied to numerous datasets and 
has proven its value also in other scientific problems. A practi- 
cal example for a situation where SigSpec performs superior to 
classical DFT-based signal-to-noise ratio estimation is given in 
TableQ] Columns 1 and 2 represent 13 identified eigenfrequen- 
cies plus 16 excited linear combinations of the 6 Set star 44Tau 
(Antoci et al. 2006). The corresponding Str0mgren v and y am- 
plitudes as obtained from a multisite campaign in 2001/02 are 
displayed in columns 3 and 4. The underlying light curves con- 
sist of 3 890 points in v and 3 582 points in y. For the analysis of 
the campaign data, the signal-to-noise ratio threshold for peak 
acceptance was chosen to be 3.5. 

The major part of the data (3 280 points in v, 2 981 
points in y) was acquired by the Vienna University Automatic 
Photoelectric Telescope (APT; Strassmeier et al. 1997; Granzer 
et al. 2001). The data represent 34 nights covering a time inter- 
val of 8 1 d. As a comparative test of the practical performance of 
SigSpec, the APT data alone were analysed using a DFT-based 
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Fig. 13. Same as Fig. [12] but displaying the relative number 
of aliases vs. the signal frequency of DFT (solid orange) and 
Lomb-Scargle Periodogram (dashed green). In this context, a re- 
sult is considered alias, if the absolute difference between result- 
ing frequency and signal frequency exceeds 0.5 d _I . The spectral 
significance does not produce a single alias maximum in the sim- 
ulations, similar to the top panel in Fig.QT] The corresponding 
graphs are thus not presented here. The DFT graph is offset verti- 
cally by -0.01 and the Lomb-Scargle graph by +0.01 to provide 
better visibility at and close to zero. 

prewhitening sequence relying on a signal-to-noise ratio limit 
of 4. Except for the higher threshold, this is exactly the same 
technique as applied to the campaign data. In any case, the fre- 
quencies found in the APT subset were cross-identified with the 
frequencies published for the full dataset (Antoci et al. 2006). 
The result for the APT data is displayed in columns 5 and 6. In 
v, 20 of the 29 signal components are found, in y only 15. With 
each filter, 1 d _I aliasing is found for 2 frequencies, which are 
indicated by italc print. 

Columns 7 and 8 refer to the corresponding SigSpec analysis 
using a spectral significance limit of 5.46 and reproducing all 
frequencies except for one in v and 5 in y. The number of aliases 
is 8 in v and 4 in y, but the majority of aliases in the SigSpec result 
occurs for frequencies not resolved by the alternative technique 
at all. 

For additional practical examples, the reader is referred to 
Reegen (2005). 

8. Further Topics 

8.1. Spectral significance for statistically weighted time 
series 

Astronomical measurements are generally influenced by instru- 
mental and environmental conditions changing with time. Thus 
different accuracies for different data points are involved, which 
are frequently desired to be taken into account by applying 
weights to the observables. If the accuracy is poor, the corre- 
sponding weight is low, and high-accuracy data points are as- 
signed a high weight, respectively. Furthermore, multisite cam- 
paigns employ different telescopes with different instrumental 
parameters, which may also require an appropriate weighting. 

This section refers to a set of statistical weights, jk, k = 
0, ...K— 1, normalized according to 

K-\ 

J]y k =:K. (53) 

jfc=0 
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Table 1. Frequencies detected in multisite data of 44Tau, ac- 
cording to Antoci et al. (2006). A,, and A, are the published am- 
plitudes (mmag) in Str0mgren v and y for the 2001/02 multisite 
data (3 890 points in v, 3 582 points in y). The corresponding 
columns indicated by 'S/N' contain a cross-identification for the 
data obtained by the Vienna University Automatic Photoelectric 
Telescope (Strassmeier et al. 1997; Granzer et al. 2001) in the 
course of this campaign (3 280 points in v, 2 98 1 points in y). An 
amplitude signal-to-noise ratio > 4 was used. The columns in- 
dicated by 'sig' represent the cross-identification of the Vienna 
APT data with the complete campaign dataset using the result 
of the SigSpec analysis, based on a spectral significance limit of 
5.46. Amplitudes in italic print refer to 1 d -1 alias peaks. Since 
the peaks for the APT subset were assigned to corresponding 
peaks in the total dataset, individual frequencies are not dis- 
played. 
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Weighting may be considered as counting an item £j of, e. g., 
an arithmetic mean repeatedly instead of only once. If the num- 
ber of counts for an arbitrary & is n^, then the arithmetic mean 
over all k = 0, 1, ...,K— 1 is expressed as 



<&> 



2f= n k 



(54) 



In this case, eq.|53]demands the normalized weights to be intro- 
duced according to 



Xf=0 n k 



(55) 



These arguments may consistently be followed to obtain the sub- 
sequent relations (eqs.l56ltol62l. 

The implementation of statistical weights has to be per- 
formed for all elements of the spectral significance relation 
(eq.[24] or|25] respectively). 



1 . The weighted mean value to be subtracted from every data 
point is calculated according to 



1 K ~ l 

*=0 



YkXk 



(56) 



Further considerations assume Xk to be already zero-mean 
corrected. 

2. Also the weighted mean variance of the data, 



1 f=| 

k=0 



(57) 



can be used as an estimator for the variance of the corre- 
sponding population. 
3. The DFT is determined by the weighted Fourier Coefficients 



flZM ( 



1 

i(w„) := — ^ ykx k cos co n t k , 

it=0 

1 K ~ x 

bxmioJn) ■= — ^ykXk sin a> n t k . 

k=Q 



(58) 



(59) 



4. The parameters characterizing the sampling properties may 
be generalized according to 



Oq (w, 0q) = 



K 2 



K-\ 



K^y k cos 2 (tot k -0 O )- ^ Jk cos (coti - G ) 
2 



fe=0 



K-\ 



k=0 



, (60) 



Pl(u,Oo) = 



K-l 



K 2 



[ k=0 [ k=0 

tan 26o (to) 

■ K-l K-l 

K ^7k sin 2ut k -2^y k cos cot k ^ y k sin a>t k 
k=0 k=0 i 

K-l (K-l 

K 2^ Jk cos 2u>t k - ^ Jk cos a>t k 



(61) 



k=0 
(K-l 



k=0 
\2 



U=o 



^ y k sin cot k 
U=o 



(62) 



These parameters provide eq.|25]to return the weighted signifi- 
cance spectrum of the dataset. 

In the case of jk =: 1 Vfe, these relations consistently reduce 
to eqs.[l0] [T7J [18] for unweighted analysis, respectively. Given 
the above re-definitions, eq.|24]and the corresponding considera- 
tions - conversion between signal-to-noise ratio and signficance 
( 15.5.31 ) and Sock Diagram ( 15.6b - readily apply to weighted time 
series as well. 



8.2. Spectral significance for zero-mean corrected subsets 

In some cases one may demand to correct the mean magnitude 
for individual subsets instead of the entire time series, i. e. if the 
data were obtained by different instruments, or if the subtraction 
of nightly averages is performed. In this case the influence on the 
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statistical description of the noise to which an amplitude level is 
compared has to reflect the subdivision correspondingly. Given 
a subset index, s = 1 , 2, S , where S denotes the total number 
of subsets, eqs.[3]0have to be re-written according to 



^ ^ x sk cos ojt sk - Xsk Yj C0S <Jjtsl 



a Z M («) = 
K 

ls=l V fc=0 



K s — 1 AT.,-1 



1=0 



K.-l K.-l 



^ *s* sin sA - — 2^ x S i ^ sin wf. s/ 



i=0 



/=() 



(63) 



(64) 



In this context, each registration time and observable is assigned 
two indices, writing x S k- The first index refers to the subset, and 
the second index is the identifier of the data point within the 
subset. The number of data points within the subset s is denoted 
by K s , and finally K := £j=i K s for the total number of data 
points in the entire time series. 

Further calculations follow the procedure described in 
Section|5]exactly and yield 
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(65) 



for the orientation of the rms error ellipse, in analogy to eq.fTOl 
Correspondingly, the normalized axes of the ellipse (eqs. [T7l[T8] l 
evaluate to 



a (to, 9 ) = 



1 



'2 s ( Ks ~ l 

cos2 (wfs * ~ 0o) 

K s -1 ] 2 

^ cos (cot s i - 6 ) 



(66) 



Po(to,e ) = 



(K s -1 



1 



— J] \ ^ Sin2 ^ Wf ^ - O) 
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^ sin (wf S ( - 6» ) 
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2 U 



(67) 



8.3. Colored noise 

SigSpec does not take into account colored noise. Since there 
are both instrumental effects (e. g. CCD readout, stability of the 
spacecraft position) and stellar variations (e. g. low-amplitude 
modes, granulation noise) to be considered and neither of these 
two sources may be determined unambiguously, it is presently 
impossible to deduce a reliable amplitude noise profile for mea- 
surements in many cases. 

The heuristic approach to generate a noise spectrum by 
means of (weighted) moving averages suffers from the presence 
of unresolved peaks, which increases the risk to miss intrinsic 
signal. Hence it is advisable to use SigSpec for the detection of 
significant sinusoidal signals and to take effects due to colored 
noise (however the latter may have been determined) into ac- 
count by choosing different spectral significance thresholds for 
different frequency regions. 



However, there is evident demand for further investigation 
in the presence of colored noise. The statistical description of 
colored noise and its implementation into the evaluation of the 
spectral significance is the subject of present and ongoing inves- 
tigation and will be discussed in a dedicated publication (paper 
III). 



9. Conclusions 

SigSpec exceeds the diagnostic capabilities of Discrete Fourier 
Transform, Lomb-Scargle Periodogram, Phase Dispersion 
Minimization, and least-squares fits in various respects. 

1. SigSpec does not compare one peak amplitude to another. 
Thus it avoids implications to the statistical independence 
of peaks in the amplitude spectrum at a single prewhitening 
stage and the ambiguity in defining a Nyquist Frequency, 
which all compared methods suffer from. The implicit 
comparison of peaks by selecting the dominant one for 
prewhitening may be performed free of the frequency and 
phase dependencies of Fourier amplitudes, if the spectral sig- 
nificance is used instead ( 16.11 ). One remaining issue in this 
context is the effect of multiple hypothesis testing on step- 
by-step identification and prewhitening. The propagation of 
uncertainties from one prewhitening step to another will be 
a topic in Paper II examining the DFT on multi-periodic sig- 
nals. 

2. SigSpec is based on an analytically straight determination of 
the amplitude Probability Density Function ( 15.3b . Provided 
the noise is white (but not necessarily Gaussian), SigSpec re- 
turns exact results instead of signal-to-noise ratio estimates 
( 16. 11 1. A more realistic approach is the introduction of col- 
ored noise in the sense of serially correlated measurements. 
Paper III is planned to discuss the appropriate incorporation 
of the serial correlation into the spectral significance compu- 
tation. 

3. SigSpec is the first technique in astronomical time series 
analysis to use both frequency and phase angle in the compu- 
tation of the False-Alarm Probability (Section|5]l, appropri- 
ately encountering the fact that the amplitude noise level is 
systematically different for different frequencies and phases 
in Fourier Space d6.2t . The statistics employed by SigSpec 
takes into account all available information provided by the 
Discrete Fourier Transform. 

4. The performance of SigSpec for a single signal plus noise 
is discussed in this paper ( 16.21 16.31 ). For strong signal com- 
ponents, the peak frequencies returned by SigSpec are con- 
siderably more accurate than the solutions of all compared 
methods. For weak signal components, SigSpec still provides 
a slightly higher accuracy. Furthermore, there is indication 
that the SigSpec frequencies are at least as accurate as the 
least-squares fits. The systematic examination of multiperi- 
odic signals and a comparison to non-linear least-squares fits 
will be provided in paper II. 

5. Signal-to-noise ratio estimation provides a valid significance 
estimate, but becomes very poor for low frequencies and in 
case of periodicities in the time-domain sampling d6.4| i. 

6. Signal-to-noise ratio-based period detection is a matter of 
experience and - to some extent - personal taste. SigSpec 
provides a solution that is completely free of subjective in- 
fluence. Interpretation is no longer a part of the analysis, and 
the results remain exactly the same, if two different persons 
apply the calculations to the same data. 
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Tests on synthetic and real data confirm that the SigSpec analysis 
is superior to signal-to-noise ratio estimates (Section[6]). 

The Sock Diagram (15.6b . as a generalized analogy to the 
spectral window, provides increased information on the proper- 
ties of the time-domain sampling in Fourier Space, since incor- 
porating phase-resolution as well. Periodicities in the sampling 
of a time series are recovered by the spectral window and provide 
a measure of the susceptibility of DFT to aliasing. In addition, 
the Sock Diagram indicates frequency and phase regions with 
high systematic errors of DFT amplitudes. Both error sources 
may cause the classical signal-to-noise ratio estimation to fail. 

The incorporation of statistical weights into the spectral sig- 
nificance calculation is provided (18. U . as well as the modifica- 
tion of the spectral significance computation, if a time series is 
assumed to consist of several zero-mean corrected subsets d8.2| i. 
Both features also implemented in the present software version. 

The SigSpec software has proved its validity and excel- 
lent performance in numerous practical applications, mainly in 
connection with the data from the MOST satellite (Walker et 
al. 2005; Guenther et al. 2005), but also for the photometric 
search for new pre-main sequence Delta Scuti stars in young 
open clusters (Zwintz et al. 2004; Zwintz & Weiss 2006). In ad- 
dition, SigSpec was successively employed for the photometric 
and spectroscopic determination of the rotation periods of se- 
lected roAp stars (Sachkov et al. 2004; Ryabchikova et al. 2005). 

However, although the statistically unbiased approach used 
by SigSpec improves the reliability of results considerably, the 
software does not provide an invitation to blind belief. It success- 
fully separates the detection of formally significant sinusoidal 
signal components in a time series from their interpretation in a 
physical context, nothing more. The practical application of the 
program still requires a careful examination of results, namely 
the decision whether a formally significant frequency is related 
to the observed physical process or an (e. g. instrumental) arti- 
fact. 
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Appendix A: Orientation of the rms Error Ellipse of Fourier Coefficients 

Rotating the coordinate system in Fourier Space by a phase angle ft transforms ozm, £>zm from eqs.[5][6]into 

k-i r . k-i 

K ^ x k cos (cot k - ft) - — 
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a {co, ft) = i ^ x k cos (cot k - ft) - ^ cos (coti - ft) 

fc=0 L /=0 

1 * 1 f 1 K 1 

/? (w, ft) = — 2^ sin (cot k - ft) - — 2^ sin ( wf / - o) 



(A.l) 



(A.2) 



In terms of the population variance (x 2 ) of the time-domain data x k , the population variances of a and /3 are 

/ x 2\ C K-l [K-l 

(a 2 ){aj,e ) = I-^jtf^cos^w^-ft)- J] cos (<^- ft) 



it=o 

A'-l 



fc=0 



./=0 
A-l 



(/3 2 )(a),e ) = ^/jtfj]sin 2 («r t -ft)- ^sin^-ft) 



The covariance 
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vanishes for a random population, if 

k-i r , k-i 

cos (cot k ~ o)~ 

A=0 L 1=0 "" /=!) 

The constant ft may be separated from the sums, which yields 

I K K K \ 
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(A.6) 



cos Uq sin «o 



t=0 (t=0 




7 




f K \ 


2 


IK \ 


2 


^ a cos 


+ 


y ^ sin cjt k 








U=o 





(A.7) 



where it is not necessary to distinguish between indices k and I any more. Eq. lA.7l mav be expressed in terms of 2ft and 2cot k , 
respectively, according to 



cos 2 ft 



sin 2 1 
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K sm 2a>t k - 2 ^ cos tot k ^ sin 



cot k 



k=0 
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U=o 





(A.8) 



This immediately leads to eq.fTOl 

Appendix B: PDF Transformation 

B. 1. General PDF transformation 

According to, e.g., Stuart & Ord (1994), pp. 20ff, the PDF conversion between a random variable x and a random variable y 
connected via y := y (x) is given by 



: (X) = 4>y 00 



For multivariate distributions, the generalization of differentials yields 

(f>x (x) = <p y (y) 



(B.l) 



(B.2) 
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with the Jacobian 



Syi dy 2 dy, 

dx\ dx] dx\ 

dyt dy 2 dy 3 

dx 2 dx 2 dx 2 

dy\ dy 2 dyi 

dx^ dx^ dxj 



(B.3) 



B.2. Example: transformations of PDF and expected value for the exponential distribution 
If y is an exponentially distributed random variable with the PDF 

v (y)=:*e-*\ (B.4) 
eq. lB.ll for x :- ^Jy evaluates to 

cf> x (x) = 2/ae - ** 2 , (B.5) 

where both x, y are considered positive semi-definite. 
The expected value of y is 

(y)=: f dyy<t> y (y) = - , (B.6) 

Jo K 

as obtained from eq. IB.4l Correspondingly, eq, IB.5l vields 

/ 71 

dxxcf> K (x)= \\— , (B.7) 

o 

so that 



2k' 



(y) = - (xf (B.8) 

describes the transformation of expected values corresponding to y — x 2 . 

For equidistantly sampled noise Xk in the time domain, the power spectrum associated with the Fourier Series is exponentially 
distributed (Schuster 1898; Scargle 1982) according to 

where (x 2 j denotes the population variance of the noise. The previous considerations apply through the substitution k =: 5^y> 

normalized to single-sided power spectral density (which introduces the factor 2). According to eq. IB.6l the noise level in the 
spectrum of squared amplitudes evaluates to 

, * 2(x 2 ) 

^ = K~ • (R10) 
For the expected value of the amplitude (i. e. the amplitude noise level), eq, IB.7l vields 



<A>=y|<^ 2 >. (B.ll) 
Eq. lB.8l leads to the conversion between the expected values of amplitude, (A) and squared amplitude, \A 2 y according to 

(A 2 ) = -<A> 2 . (B.12) 

This is in agreement with the observational results by Home & Baliunas (1986). 

An amplitude signal-to-noise ratio of 4 - introduced as an approximate significance criterion by Breger et al. (1993) - corre- 
sponds to a power signal-to-noise ratio of 4 n m 12.57. 
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